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CLAIMS: 

1 . A method for programming forwarding tables for switches for multipathing in a 
subnet of a switched fabric including at least a host system, a target system and switches each 
having one or more ports interconnected via links, said method comprising: 

determining all possible links between all ports on the subnet during topology discovery; 
creating an all port connectivity table which records all port-to-port connectivity 
information; 

creating an all switch shortest paths table which records all the shortest paths between 
every switch pair on the subnet based the port-to-port connectivity information; and 

computing forwarding tables for respective switches on the subnet that allow usage of 
multiple paths between switch pairs based on the port-to-port connectivity information and based 
on the shortest paths between every switch pair. 

2. The method as claimed in claim 1 , further comprising: 

downloading the forwarding tables to respective switches on the subnet that allow usage 
of multiple paths between switch pairs; and 

enabling respective switches on the subnet to route data packets from the host system to 
the target system via mutiple paths through the switched fabric. 
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3. The method as claimed in claim 1 , wherein each of said host system and said 
target system includes a channel adapter (CA) installed supporting one or more ports with each 
port having multiple local identifiers (LIDs) assigned thereto for multipathing. 

4. The method as claimed in claim 3, wherein each port on the subnet supports a 
unique 16-bit LID and a LID Mask Control (LMC) which specifies the number of low order bits 
of the LID to mask when checking a received destination LID against the port's destination LID. 

5. The method as claimed in claim 1 , wherein said forwarding tables are computed 
to ensure loop-less paths and allow ports to be addressed by multiple local identifiers (LIDs), and 
wherein said all-port connectivity and all-switch shortest paths tables are constantly updated 
reflecting any dynamic changes to the subnet topology, 

6. The method as claimed in claim 1, wherein said forwarding tables are computed 
based on the principle that only the shortest path between a given switch pair is guaranteed to 
overlap with other shortest paths that either originate from or destined to some intermediate port 
that exists on the shortest path between the original switch pair. 

7. The method as claimed in claim 1 , wherein a forwarding table for a switch is 
computed by: 
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detennining a destination switch to which a destination port is directly connected, 
identifying all the links that exist between the destination switch and other switches in the 

subnet; 

sorting all the links by respective originating port number in an ascending order; 
picking an appropriate link and identifying the switch to which the link is connected at 
the other end; 

determining the best route between the switch identified and the switch for which the 
forwarding table is being constructed; and 

inputting associated outport number at a designated location in the forwarding table. 

8. The method as claimed in claim 1, wherein said shortest paths between every 
switch pair are computed utilizing an All Pair Shortest Paths (APSP) algorithm, and each shortest 
path from the source to the destination switch is represented by a <Port, Cost> duple in which 
port is the port number of the source switch where the path originates and cost is the path cost 
metric that is computed based on a hop count, a message transfer (MTU) size, a link speed, width 
and other port and link characteristics. 

9. The method as claimed in claim 4, wherein a forwarding table for a switch is 
computed by: 

determining a destination switch to which a destination port is directly connected, 
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identifying all the links that exist between the destination switch and other switches in the 

subnet; 

sorting all the links by respective originating port number in an ascending order; 
picking an appropriate link and identifying the switch to which the link is connected at 
the other end; 

determining the best route between the switch identified and the switch for which the 
forwarding table is being constructed; and 

inputting associated outport number at a designated location in the forwarding table. 

1 0. The method as claimed in claim 4, wherein each of said forwarding tables for 
switches in the subnet is computed, when multiple LIDs are assigned to channel adapter (CA) 
ports, by a multipath assignment algorithm configured to: 

receive information during the topology discovery including the number of multiple paths 
(m) for configuration, the switch LID for which the forwarding table is being built (LID S ), and 
the LID in the forwarding table which is currently identifying the correct outport (LID d ) for 
forwarding data packets; 

determine the base LID (LID dbase ) for the given LID d and determine if the base LID 
(LID dbase ) for the given LID d corresponds to a switch using the all port connectivity table; 

if the base LID (LID dbase ) for the given LID d corresponds to a switch using the all port 
connectivity table, check if the base LID (LID dbase ) for the given LID d corresponds to the switch 
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LID for which the forwarding table is being built (LID S ); 

if the base LID (LID dbase ) for the given LID d corresponds to the switch LID for which the 
forwarding table is being built (LID S ), indicate that there is no route for the given LID d ; 

if the base LID (LID dbase ) for the given LID d happens to be any switch other than the 
switch LID for which the forwarding table is being built (LID S ), check if the base LID (LID dbase ) 
for the given LID d corresponds to the given LID d ; 

if the base LID (LID dbase ) for the given LID d corresponds to the given LID d , set an 
arbitrary LIDx as the base LID (LID dbase ) for the given LID d , and proceed to get the port number 
of switch LID S for the best route between switch LID S and LIDx from the all switch shortest 
paths table; 

if the base LID (LID dbase ) for the given LID d does not correspond to a switch using the all 
port connectivity table, identify the peer node and port to which the port with LID dbase is 
connected from the all port connectivity table and determine if the peer node is a switch; 

if the peer node does not correspond to a switch, indicate that there is no route for the 
given LID d ; 

if the peer node corresponds to a switch, determine if the peer switch LID (LID sdest ) 
corresponds to the switch LID for which the forwarding table is being built (LID S ); 

if the peer switch LID (LID sdest ) corresponds to LID S? determine that the switch LID S and 
port LID dbase are directly connected, and get the appropriate port number of switch LID S from the 
all port connectivity table; 
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if the peer switch LID (LID^,) does not correspond to LID S , identify all the links that are 
directly connected to other switches from the peer switch LID (LID sdest ) and determine the 
number of (N) links that connect switch LID sdest where N is greater than "0"; 

if the number of (N) links is not greater than "0", indicate that there is no route for the 
given LID d ; 

if the number of (N) links is greater than "0", set the offset (n) of LID d from LID dbase , and 
determine if the number of multipaths (m) is less than the offset (n) of LID d from LID dbase and if 
the number of (N) links that connect switch LID sdest to other switch ports is less than the offset (n) 
ofLID d from LID dbase ; 

if either the multipaths (m) or the (N) links is less than the offset (n) of LID d from 
LID dbase , indicate that there is no route for the given LID d ; 

if neither the multipaths (m) nor the (N) links is less than the offset (n) of LID d from 
LID dbase , store the LIDs of switches to which N links are connected in a list O(i) and sort the list 
by the port number of switch LID sdest from where each of N links originate in an ascending order; 

set the list 0(i).LID to the LID of the switch that the link O(n) is connected at the other 
end, and check if 0(i).LID corresponds to the switch LID for which the forwarding table is being 
built (LID S ); 

if the list 0(i).LID corresponds to the switch LID for which the forwarding table is being 
built (LID S ), set an arbitrary LIDx as the LID of the switch to which the port LID d is directly 
connected (LID^) and then get the port number of switch LID S for the best route between 
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switch LID S and LIDx from the all switch shortest paths table; 

if the list 0(i).LID does not correspond to the switch LID for which the forwarding table 
is being built (LID S ), set an arbitrary LIDx as the 0(n).LID, and then get the port number of 
switch LID S for the best route between switch LID S and LIDx from the all switch shortest paths 
table; and 

determine if the LID d is the last LID assigned to a port or switch in the subnet, and 
terminate computation of the forwarding table when LID d is the last LID assigned to a port or 
switch in the subnet. 

11. A data network, comprising: 

a host system having at least one channel adapter (CA) installed therein supporting one or 
more ports with each port having multiple local identifiers (LIDs) assigned thereto for 
multipathing; 

at least one target system having at least one channel adapter (CA) installed therein 
supporting one or more ports with each port having multiple local identifiers (LIDs) assigned 
thereto for multipathing; 

a switched fabric comprising a plurality of different switches which interconnect said host 
system via CA ports to said remote system via CA port along different physical links for data 
communications; and 

a fabric manager provided in said host system for making topology discovery, assigning 



40 



219.39661X00 
LID#: 17318/P11067 

local identifiers (LIDs) to all ports that are connected in the switched fabric, and programming 
forwarding tables for switches in the switched fabric, wherein said fabric manager programs 
forwarding tables for switches for multipathing by: 

determining all possible links between all ports that are connected in the switched 
fabric during topology discovery; 

creating an all port connectivity table which records all port-to-port connectivity 
information; 

creating an all switch shortest paths table which records all the shortest paths 
between every switch pair on the switched fabric based the port-to-port connectivity 
information; and 

computing forwarding tables for respective switches on the switched fabric that 
allow usage of multiple paths between switch pairs based on the port-to-port connectivity 
information and based on the shortest paths between every switch pair. 

1 2. The data network as claimed in claim 1 1 , wherein said fabric manager is 
configured to download the forwarding tables to respective switches on the switched fabric that 
allow usage of multiple paths between every switch pair, and enable respective switches on the 
switched fabric to route data packets from the host system to the target system via multiple paths 
through the switched fabric. 
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13. The data network as claimed in claim 1 1 ? wherein each port on the subnet 
supports a unique 16-bit LID and a LID Mask Control (LMC) which specifies the number of low 
order bits of the LID to mask when checking a received destination LID against the port's 
destination LID. 

1 4. The data network as claimed in claim 1 1 , wherein said forwarding tables are 
computed to ensure loop-less paths and allow ports to be addressed by multiple local identifiers 
(LIDs), and wherein said all-port connectivity and all-switch shortest paths tables are constantly 
updated reflecting any dynamic changes to the subnet topology. 

1 5. The data network as claimed in claim 1 1 , wherein said forwarding tables are 
computed based on the principle that only the shortest path between a given switch pair is 
guaranteed to overlap with other shortest paths that either originate from or destined to some 
intermediate port that exists on the shortest path between the original switch pair. 

16. The data network as claimed in claim 11, wherein said fabric manager is 
configured to compute a forwarding table for a switch by: 

determining a destination switch to which a destination port is directly connected, 
identifying all the links that exist between the destination switch and other switches in the 

subnet; 
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sorting all the links by respective originating port number in an ascending order; 
picking an appropriate link and identifying the switch to which the link is connected at 
the other end; 

determining the best route between the switch identified and the switch for which the 
forwarding table is being constructed; and 

inputting associated outport number at a designated location in the forwarding table. 

1 7. The data network as claimed in claim 1 1 , wherein said shortest paths between 
every port pair are computed utilizing an All Pair Shortest Paths (APSP) algorithm, and each 
shortest path from the source to the destination switch is represented by a <Port, Cost> duple in 
which port is the port number of the source switch where the path originates and cost is the path 
cost metric that is computed based on a hop count, a message transfer (MTU) size, a link speed, 
width and other port and link characteristics. 

1 8. The data network as claimed in claim 1 1 , wherein said fabric manager is provided 
with an algorithm for computing forwarding tables for switches, when multiple LIDs are 
assigned to channel adapter (CA) ports, by: 

receiving information during the topology discovery including the number of multiple 
paths (m) for configuration, the switch LID for which the forwarding table is being built (LID S ), 
and the LID in the forwarding table which is currently identifying the correct outport (LID d ) for 
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forwarding data packets; 

determining the base LID (LID dbase ) for the given LID d and if the base LID (LID dbase ) for 
the given LID d corresponds to a switch using the all port connectivity table; 

if the base LID (LID dbase ) for the given LID d corresponds to a switch using the all port 
connectivity table, checking if the base LID (LID dbase ) for the given LID d corresponds to the 
switch LID for which the forwarding table is being built (LID S ); 

if the base LID (LID dbase ) for the given LID d corresponds to the switch LID for which the 
forwarding table is being built (LID S ), indicating that there is no route for the given LID d ; 

if the base LID (LID dbase ) for the given LID d happens to be any switch other than the 
switch LID for which the forwarding table is being built (LID S ), checking if the base LID 
(LID dbase ) for the given LID d corresponds to the given LID d ; 

if the base LID (LID dbase ) for the given LID d corresponds to the given LID d? setting an 
arbitrary LIDx as the base LID (LID dbase ) for the given LID d , and getting the port number of 
switch LID S for the best route between switch LID S and LIDx from the all switch shortest paths 
table; 

if the base LID (LID dbase ) for the given LID d does not correspond to a switch using the all 
port connectivity table, identifying the peer node and port to which the port with LID dbase is 
connected from the all port connectivity table and determining if the peer node is a switch; 

if the peer node does not correspond to a switch, indicating that there is no route for the 
given LID d ; 
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if the peer node corresponds to a switch, determining if the peer switch LID (LID sdest ) 
corresponds to the switch LID for which the forwarding table is being built (LID S ); 

if the peer switch LID (LID sdest ) corresponds to LID S , determining that the switch LID S 
and port LID dbase are directly connected, and getting the appropriate port number of switch LID S 
from the all port connectivity table; 

if the peer switch LID (LID sdest ) does not correspond to LID S , identifying all the links that 
are directly connected to other switches from the peer switch LID (LID sdesl ) and determining the 
number of (N) links that connect switch LID sdest where N is greater than M 0 ,T ; 

if the number of (N) links is not greater than "0", indicating that there is no route for the 
given LID d ; 

if the number of (N) links is greater than "0", setting the offset (n) of LID d from LID dbase , 
and determining if the number of multipaths (m) is less than the offset (n) of LID d from LID dbase 
and if the number of (N) links that connect switch LID sdest to other switch ports is less than the 
offset (n) of LID d from LID dbase ; 

if either the multipaths (m) or the (N) links is less than the offset (n) of LID d from 
LID dbase , indicating that there is no route for the given LID d ; 

if neither the multipaths (m) nor the (N) links is less than the offset (n) of LID d from 
LID dbase , storing the LIDs of switches to which N links are connected in a list O(i) and sorting the 
list by the port number of switch LID sdest from where each of N links originate in an ascending 
order; 
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setting the list 0(i).LID to the LID of the switch that the link 0(n) is connected at the 
other end, and checking if 0(i).LID corresponds to the switch LID for which the forwarding table 
is being built (LID S ); 

if the list 0(i).LID corresponds to the switch LID for which the forwarding table is being 
built (LID S ), setting an arbitrary LIDx as the LID of the switch to which the port LID d is directly 
connected (LID sdest ) and then getting the port number of switch LID S for the best route between 
switch LID S and LIDx from the all switch shortest paths table; 

if the list 0(i).LID does not correspond to the switch LID for which the forwarding table 
is being built (LID S ), setting an arbitrary LIDx as the 0(n).LID, and then getting the port number 
of switch LID S for the best route between switch LID S and LIDx from the all switch shortest 
paths table; and 

determining if the LID d is the last LID assigned to a port or switch in the subnet, and 
terminating computation of the forwarding table when LID d is the last LID assigned to a port or 
switch in the subnet. 

1 9. A computer readable medium comprising instructions that, when executed by a 
computer system, cause the computer system to: 

determine all possible links between all ports on a subnet including at least a host system, 
a target system and switches each having one or more ports interconnected via links during 
topology discovery; 



46 



219.39661X00 
LID#: 17318/P1 1067 

create an all port connectivity table which records all port-to-port connectivity 
information; 

create an all switch shortest paths table which records all the shortest paths between every 
port pair on the subnet based the port-to-port connectivity information; and 

compute forwarding tables for respective switches on the subnet that allow usage of 
multiple paths between port pairs based on the port-to-port connectivity information and based 
on the shortest paths between every port pairs. 

20. The computer readable medium as claimed in claim 19, further causing the 
computer system to: 

download the forwarding tables to respective switches on the subnet that allow usage of 
multiple paths between port pairs; and 

enable respective switches on the subnet to route data packets from the host system to the 
target system via multiple paths through the switched fabric. 

2 1 . The computer readable medium as claimed in claim 1 9, wherein each port on the 
subnet supports a unique 16-bit LID and a LID Mask Control (LMC) which specifies the number 
of low order bits of the LID to mask when checking a received destination LID against the port's 
destination LID. 
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22. The computer readable medium as claimed in claim 1 9, wherein said forwarding 
tables are computed to ensure loop-less paths and allow ports to be addressed by multiple local 
identifiers (LIDs), and wherein said all-port connectivity and all-switch shortest paths tables are 
constantly updated reflecting any dynamic changes to the subnet topology. 

23. The computer readable medium as claimed in claim 1 9, wherein said forwarding 
tables are computed based on the principle that only the shortest path between a given port pair is 
guaranteed to overlap with other shortest paths that either originate from or destined to some 
intermediate port that exists on the shortest path between the original port pair. 

24. The computer readable medium as claimed in claim 19, wherein a forwarding 
table for a switch is computed by: 

determining a destination switch to which a destination port is directly connected, 
identifying all the links that exist between the destination switch and other switches in the 

subnet; 

sorting all the links by respective originating port number in an ascending order; 
picking an appropriate link and identifying the switch to which the link is connected at 
the other end; 

determining the best route between the switch identified and the switch for which the 
forwarding table is being constructed; and 
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inputting associated outport number at a designated location in the forwarding table. 

25. The computer readable medium as claimed in claim 19, wherein said shortest 
paths between every switch pair are computed utilizing an All Pair Shortest Paths (APSP) 
algorithm, and each shortest path from the source to the destination switch is represented by a 
<Port, Cost> duple in which port is the port number of the source switch where the path 
originates and cost is the path cost metric that is computed based on a hop count, a message 
transfer (MTU) size, a link speed, width and other port and link characteristics. 
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